Modeling dependency in adaptation of acoustic models using multiscale tree processes

نویسندگان

  • Ashvin Kannan
  • Mari Ostendorf
چکیده

To adapt the large number of parameters in a speech recognition acoustic model with a small amount of data, some notion of parameter dependence is needed. We present a dependence model to relate parameters in a parsimonious framework using a Gaussian multiscale process de ned by the evolution of a linear stochastic dynamical system on a tree. To adapt all classes from all adaptation data, we formulate adaptation as optimal smoothing of the tree process. This approach is used to adapt two types of models: Gaussians, and Gaussian processes (segment models) characterized by a polynomial mean trajectory. Recognition results presented on the Switchboard corpus show improvements in supervised and unsupervised modes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling dependency between regression classes in MLLR using multiscale autoregressive models

Adapting acoustic models to a new environment is usually realized by considering model transformations that are estimated on the adaptation corpus. Since such a corpus usually contains very few data, the models' Gaussians are most often partitioned into a few regression classes, and all the Gaussians in the same class share the same transformation. It is further possible to increase the number ...

متن کامل

Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition

Two models of statistical dependence between acoustic model parameters of a large vocabulary conversational speech recognition (LVCSR) system are investigated for the purpose of rapid speakerand environment-adaptation from a very small amount of speech: (i) a Gaussian multiscale process governed by a stochastic linear dynamical system on a tree, and (ii) a simple hierarchical treestructured pri...

متن کامل

Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition

To recognize non-native speech, larger acoustic/linguistic distortions must be handled adequately in acoustic modeling, language modeling, lexical modeling, and/or decoding strategy. In this paper, a novel method to enhance MLLR adaptation of acoustic models for non-native speech recognition is proposed. In the case of native speech recognition, MLLR speaker adaptation was successfully introduc...

متن کامل

Speaker adaptation using tree structured shared-state HMMs

This paper proposes a novel speaker adaptation method that exibly controls state-sharing of HMMs according to the amount of adaptation data. In our scheme, acoustic modeling is combined with adaptation to e ciently utilize the acoustic models sharing characteristics for adaptation. The shared-state set of HMMs is determined by using tree-structured shared-state HMMs created from the history rec...

متن کامل

Comparison of acoustic model adaptation techniques on non-native speech

The performance of speech recognition systems is consistently poor on non-native speech. The challenge for non-native speech recognition is to maximize the recognition performance with small amount of non-native data available. In this paper we report on the acoustic modeling adaptation for the recognition of non-native speech. Using non-native data from German speakers, we investigate how bili...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997